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ABSTRACT 



Lantran Chincode 
English language 



Presented in this paper is an illustration of the 
System for coding the Chinese language on ordinary 
typewriters or computer terminal keyboards. The key 
element of the Lantran coding is the use of the Pinyin phonetic^ 
romanization which has been officially standard in Communist China 
for the past 10 years. The end letters in the full coding for each 
character are the phonetic transcription of the pronunciation of that 
character in the Pinyin. Since many characters have the same 
pronunciation, the first part of the coding indicates as concisely as 
possible which particular character is intended. In general this 
means specifying the radical of the character and the number of 
additional strokes required, these two items together with the 
pronunciation being sufficient to correctly identify the written 
character in all but a few cases. The key board arrangement pictured 
is that of the Frieden Communications Terminal generating USASCII 
code for computer. (Authors/AMM) 
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Keyboard arrangement is that of Friden Communications Terminal generating USASCII code for 
computer. Punctuation locations will vary on other typewriters. a ..on mm i 
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THE LANTRAN CH INCODE SYSTEM 

For Ordinary English Language Typewriter 
or Computer Terminal Keyboard 



Key element of the Lantran coding is the use of the Pinyin 
phonetic romanization which has been officially standard in Com- 
munist China for the past ten years. The end letters in the full 
coding for each character are the phonetic transcription of the 
pronunciation of that character in the Pinyin. Since many charac- 
ters have the same pronunciation the first part of the coding 
indicates as concisely as possible which particular character is 
intended. In general this means specifying the radical of the 
character and the number of additional strokes required, these two 
items together with the pronunciation being sufficient to correct- 
ly identify the written character in all but a few cases . 

To illustrate: The pronunciation LIANG applies to a dozen 

or so commonly used characters. By referring to the keyboard 
illustration it is apparent that LIANG can be coded by striking 
only three keys : 

L = L 
I = I 

,ANG = 7, so LI7 will be the code ending. 

The first two elements will be the dictionary information 
of the radical, given in three digits, and the added strokes, 
given in one or two digits. Thus: 





Character 


Radical 


(measure) 




166 


(cold) 




085 


(two) 




Oil 


(bright) 




008 


(vehicles ) 




159 


(grain) 


W 


119 


(good) 




138 



Strokes 


Pinyin 


Full Code 


5 


LI 7 


1665LI7 


8 


LI 7 


0858LI7 


6 


LI 7 


0116LI7 


7 


LI 7 


0087LI7 


8 


LI 7 


1598LI7 


12 


LI 7 


11912LI7 


1 


LI 7 


1381LI7 



The end coding must be written in capital and lower case 
I letters accofirding to the location of the phonetic s 3 nnbols on the 
I keyboard illustration. Thus: DA for DA, Z for ZI and 18 for 

YING but KoSVfor KONG, x for SHI, and x6 for SHEN. (Capital N is 
used onJ.y for the. initiy^l sound.) The coding follows the Pinyin 
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particle 1® or , with other characters pronounced GE re- 

quiring the longer code to distinguish them. 

As the Lantran system comes into general use, particularly 
for computer-assisted translation from Chinese to English, short- 
coding possibilities will be developed until a substantial part 
of the language may be coded in Pinyin with conventionalized 
sjmibols identifying frequently used expressions and full coding 
required only for less common characters. 

Editin g and codin g sy mbols : Punctuation marks written 

between characters have been reserved in the Lantran system for 
use in pre-editing. They indicate choices among translations. 
When typed into coding for the computer the marks result in a 
different output. They are also useful, however, when the coding 
dictionary is used ordinarily for study and translation. 



A dash between characters or between two or more 

units of coding indicates that they are to be joined for 

translation. 

Thus: - y'C The dash indicates that 

the two characters are not to be translated separately as 
forei gn fire but in combination as matches . 



Single, double and triple quote marks (which may be called 
"as," "astu" and "astre") indicate additional translation 
choices, chiefly for verbs and adverbs. 



Single, double and triple commas ("al, altu and altre") 
indicate other translation choices, chiefly for nouns and 
adjectives, and for measures and other grammatical forms. 



L 



The above marks follow the character or coding they affect. 
The next mark goes ahead. 

This is the old Chinese sign (jt, put to a new use. (it may 
be called "iya") It indicates that the character or more 
often the group of characters following has been short- 
coded. The effect is to greatly extend the range of pho- 
netic coding. Aklti 

Thus^AliJ 0933TA, meaning i^, is reduced to 

LTA (iya TA) , distinguishing from the short-coded TA 
meaning he . 



L,KEx is short-coding for 0302KE-0725x . 

The L, indicates that this KE-SHI is the syntactical one 
meaning but , distinguishing it from the characters also 
pronounced KE-SHI but meaning corrosion, guestroom, lime- 
stone, silicon, classroom, etc. 













Radicals ' 


on Keyboard, 


coded by 


one stroke 


each 




1 


I 


22 


F 


32 


V 


52 


3 


85 


y 


118 


Y 


154 


) 


9 


9 


24 


+ 


37 


m 


60 


q 


86 


e 


120 


P 


157 


k 


10 


r 


26 


Z 


38 


w 


61 


f 


96 


( 


130 


V 


159 


% 


14 


M 


27 


H 


39 


z 


64 


t 


104 


m 

9 


140 


c 


162 


d 


18 


D 


28 


S 


40 


a 


66 


P 


109 


f 

« 


142 


• 

1 


167 


u 


19 


n 


29 


4 


43 


7 


72 


R 


113 


& 


145 


# 


169 


• 

• 


20 


B 


30 


g 


44 


X 


75 


h 


115 


$ 


149 


s 


181 
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in making distinctions among the CH sounds, , _ ^ 4^ 

’ thus q7 for CHANG 

QI4 for CHYOU f;jc , jW for ZHU ^ and JIE for JIE ^ coded 
039JIE, or zJIE, to distinguish from the radical main form , 

coded 39 or z. CW6 is CUN (ts'un in Wade) and Zo is ZUO 

Radicals on the ke y board : In order to shorten the coding, 

English letters and symbols on the typewriter keyboard have been 

selected to represent the radicals for well over half of all the 

Chinese characters. This cuts the radical part of the coding 

from three strokes to one. r, . j j ott-; • j ^ ..u 

So IS coded y8LI7 instead of the 

full code 0858LI7, and 

Ww is coded %8LI7 instead of the full code 
1598LI7. A radical on the keyboard when used by itself for a 
word is coded by a single stroke: m for 



037 



^ as big, I for 



001 “ as one and g for l-l 030 as mouth. Radicals not on the 

keyboard may also be coded by just the radical number without the 
pronunciation- -063, 110, 207, etc. 95W1 means radical 9 plus 5 
strokes pronounced WEI. Radical 95, not on the keyboard is coded 
095. 

Short-codin g; Chinese characters vary widely in frequency 
of use. Just as in English the expressions for is, does, has, I, 
she, with, go, today, this, that and so on make up much of the 
language. If a hundred or more such words are given the shortest 
possible coding, even with arbitrary combinations, frequent rep- 
etition will still make them easy to rero, ember. The coding labor 
for any body of text will be reduced by hundreds of strokes with 
a gain in speed and efficiency. Unlike such rigid lists as the 
Chinese telegraph code, the Lantran system can profit by such 
short-coding, usually keeping all or part of the Pinyin pronun- 
ciation as a memory guide. _ , , , 

For example is short-coded 

to WoM6 , only four strokes. The short-coding GE is used for the 
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